Exploring the nonlinear geometry of protein homology.
نویسندگان
چکیده
The explosion of biological data resulting from genomic and proteomic research has created a pressing need for data analysis techniques that work effectively on a large scale. An area of particular interest is the organization and visualization of large families of protein sequences. An increasingly popular approach is to embed the sequences into a low-dimensional Euclidean space in a way that preserves some predefined measure of sequence similarity. This method has been shown to produce maps that exhibit global order and continuity and reveal important evolutionary, structural, and functional relationships between the embedded proteins. However, protein sequences are related by evolutionary pathways that exhibit highly nonlinear geometry, which is invisible to classical embedding procedures such as multidimensional scaling (MDS) and nonlinear mapping (NLM). Here, we describe the use of stochastic proximity embedding (SPE) for producing Euclidean maps that preserve the intrinsic dimensionality and metric structure of the data. SPE extends previous approaches in two important ways: (1) It preserves only local relationships between closely related sequences, thus allowing the map to unfold and reveal its intrinsic dimension, and (2) it scales linearly with the number of sequences and therefore can be applied to very large protein families. The merits of the algorithm are illustrated using examples from the protein kinase and nuclear hormone receptor superfamilies.
منابع مشابه
Inversion of Gravity Data by Constrained Nonlinear Optimization based on nonlinear Programming Techniques for Mapping Bedrock Topography
A constrained nonlinear optimization method based on nonlinear programming techniques has been applied to map geometry of bedrock of sedimentary basins by inversion of gravity anomaly data. In the inversion, the applying model is a 2-D model that is composed of a set of juxtaposed prisms whose lower depths have been considered as unknown model parameters. The applied inversion method is a nonli...
متن کاملNonlinear Vibration Analysis of a cantilever beam with nonlinear geometry
Analyzing the nonlinear vibration of beams is one of the important issues in structural engineering. According to this, an impressive analytical method which is called Modified Iteration Perturbation Method (MIPM) is used to obtain the behavior and frequency of a cantilever beam with geometric nonlinear. This new method is combined by the Mickens and Iteration methods. Moreover, this method don...
متن کاملAnalyzing the geometry of Iranian Islamic gardens based on the Quran’s characteristics of paradise
Iranian Islamic gardens like almost every cultures, represent beauty and happiness and improve the public perception. It has also special geometry with philosophical concept related to Islam’s doctrine that is the focus of this research. Following Quran’s contents, paradise is a beautiful sophisticated garden that something flows under its trees. So the comparison between the somatic geometry o...
متن کاملExploring the Efficiency of Dampers for Repair and Strengthening of Existing Buildings
In this paper, seismic behavior of the existing buildings equipped by friction dampers is studied. Seismic performance of6-story, 9-story and 12-story steel buildings with damper and without damper were studied. The finite element modeling technique (SAP2000 Software) is used for analysis. Time History analyzing was done to achieve this purpose. For nonlinear dynamic analysis, the responses of ...
متن کاملRecurrent metrics in the geometry of second order differential equations
Given a pair (semispray $S$, metric $g$) on a tangent bundle, the family of nonlinear connections $N$ such that $g$ is recurrent with respect to $(S, N)$ with a fixed recurrent factor is determined by using the Obata tensors. In particular, we obtain a characterization for a pair $(N, g)$ to be recurrent as well as for the triple $(S, stackrel{c}{N}, g)$ where $stackrel{c}{N}$ is the canonical ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Protein science : a publication of the Protein Society
دوره 12 8 شماره
صفحات -
تاریخ انتشار 2003